RANDOM PERMUTATIONS AND RELATED TOPICS 



GRIGORI OLSHANSKI 

Abstract. We present an overview of selected topics in random permutations 
and random partitions highlighting analogies with random matrix theory. 
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1. Introduction 

An ensemble of random permutations is determined by a probability distribution 
on Sn, the set of permutations of [n] := {1,2, . . . ,n}. Even the simplest instance 
of the uniform probability distribution is already very interesting and leads to a 
deep theory. The symmetric group Sn is in many ways linked to classical matrix 
groups, and ensembles of random permutations should be treated on equal footing 
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with random matrix ensembles, such as the ensembles of classical compact groups 
and symmetric spaces of compact type with the normalized invariant measure. The 
role of matrix eigenvalues is then played by partitions of n that parameterize the 
conjugacy classes in Sn- The parallelism with random matrices becomes especially 
striking in applications to constructing representations of "big groups" — inductive 
limits of symmetric or classical compact groups. 

The theses stated above are developed in Section [2l The two main themes of 
this section are the space (3 of virtual permutations (© is a counterpart of the 
space of Hermitian matrices of infinite size) and the Poisson-Dirichlet distributions 
(a remarkable family of infinite-dimensional probability distributions). We focus 
on a special family of probability distributions on Sn with nice properties, the so- 
called Ewens measures (they contain the uniform distributions as a particular case). 
It turns out that the large-n limits of the Ewens measures can be interpreted as 
probability measures on ©. On the other hand, the Ewens measures give rise to 
ensembles of random partitions, from which one gets, in a limit transition, the 
Poisson-Dirichlet distributions. 

A remarkable discovery of Frobenius, the founder of representation theory, was 
that partitions of n not only parameterize conjugacy classes in Sn but also serve 
as natural coordinates in the dual space Sn — the set of equivalence classes of 
irreducible representations of Sn- This fact forms the basis of a "dual" theory of 
random partitions, which turns out to have many intersections with random matrix 
theory. This is the subject of Sections ISHU Here we survey results related to the 
Plancherel measure on Sn and its consecutive generalizations: the z-measures and 
the Schur measures. 

Thus, the two-faced nature of partitions gives rise to two kinds of probabilistic 
models. At first glance, they seem to be weakly related, but under a more general 
approach one sees a bridge between them. The idea is that the probability measures 
in the "dual picture" can be further generalized by introducing an additional pa- 
rameter, which is an exact counterpart of the P parameter in random matrix theory. 
This parameter interpolates between the group level and the dual space level, in the 
sense that in the limit as /3 ^ 0, the "beta z-measures" degenerate to the measures 
on partitions derived from the Ewens measures, see Section 14.41 below. 

2. The Ewens measures, virtual permutations, and the 
Poisson-Dirichlet distributions 

2.1. The Ewens measures. Permutations s E Sn can be represented as n x n 
unitary matrices [sjj], where Sij equals 1 if s(j) = and otherwise. This makes 
it possible to view random permutations as a very special case of random unitary 
matrices. 

Given a probability distribution on each set S^ n = 1,2,..., one may speak 
of a sequence of ensembles of random permutations and study their asymptotic 
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properties as n oo. The simplest yet fundamental example of a probability 
distribution on Sn is the uniform distribution P^"'\ which gives equal weights l/n\ to 
all permutations s ^ Sn', this is also the normalized Haar measure on the symmetric 
group. However, a more complete picture is achieved by considering a one-parameter 
family of distributions, {Pg"^}6i>o, forming a deformation of p("): 

where i{s) denotes the number of cycles in s (the uniform distribution corresponds 
to ^ = 1). 

For reasons explained below we call P^"^ the Ewens measure on the symmetric 
group Sn with parameter 6. Obviously, the Ewens measures are invariant under the 
action of Sn on itself by conjugations. 

We propose to think of (S'„,P*^"^) as of an analogue of CUEtv, Dyson's circular 
unitary ensemble [ForlOaj formed by the unitary group U{N) endowed with the 
normalized Haar measure P^(^). More generally, the Ewens family {Pg^^} should 
be viewed as a counterpart of a family of probability distributions on the unitary 
group U{N) forming a deformation of the Haar measure: 

Pt^''\dU) = const I det((l + f/)*)|2p^(^)(df/), U G U{N), (2.2) 

where t is a complex parameter. Ret > — | (the Haar measure corresponds tot = 0). 
Using the Cayley transform one can identify the manifold U{N), within a negligible 
subset, with the flat space H{N) of N x N Hermitian matrices; then p^^'-^-' turns 
into the so-called Hua-Pickrell measure on H{N): 

Pl^^^\dX) = const I det(l + iX)-'-^\^dX, (2.3) 

where "dX" in the right-hand side denotes Lebesgue measure. For more detail, 
see [BorOlbj , jNi?02] , |Ols03aj . A similarity between Pj"^ and Pf (or Pf ^^^) is 
exploited in |Bou07] . 

A fundamental property of the Ewens measures is their consistency with respect 
to some natural projections Sn — ?■ Sn-i that we are going to describe: 

Given a permutation s G Sn-, the derived permutation s' G Sn-i sends each i = 
1, . . . ,n — 1 either to s{i) or to s{n), depending on whether s{i) ^ n or s{i) = n. In 
other words, s' is obtained by removing n from the cycle of s that contains n. For 
instance, if s = (153) (24) (meaning that one cycle insisl— t-S— 7'3— J-l and the 
other is 2 — )■ 4 — )• 2), then s' = (13) (24). The map Sn — ?• •S'n-i defined in this way is 
called the canonical projection |Ker04] and denoted as pn-i,n- For n > 5, it can be 
characterized as the only map Sn — ^ >S'„_i commuting with the two-sided action of 
the subgroup Sn-i- 

The next assertion |Ker04] is readily verified: 
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Lemma 2.1. For any n = 2,3, . . . , the push-forward of Pq under pn^i^n coincides 
withP^''-^\ 

The Hua-Pickrell measures enjoy a similar consistency property with respect to 
natural projections p%_ 

-1 N '■ ^i-^) ~^ H{N — 1) (removal of the iVth row and column 
from an X iV matrix) |Bor01b] . |Ols03a] . This fact is hidden in the old book by 
Hua Lokeng |Hua58j . in his computation of the matrix integral 

[ det{l + X^y'dX (2.4) 
Jh{n) 

by induction on A^. (Note that |Hua58] contains a lot of masterly computations of 
matrix integrals.) Much later, the consistency property was rediscovered and ap- 
plied to constructing measures on infinite-dimensional spaces by Shimomura |Shi75] . 
Pickrell |Pic87j . and Neretin |Ner02j . Note that analogues of the projections p^_i 
can be defined for other matrix spaces including the three series of compact classical 
groups and, more generally, the ten series of classical compact symmetric spaces 
|Ner02j . 

2.2. Virtual permutations, central measures, and Kingman's theorem. As 
we will see, the consistency property of the Ewens measures P^"^ makes it possible 
to build an "n = oo" version of these measures. In the particular case 6 = 1, this 
leads to a concept of "uniformly distributed infinite permutations" . 

Let & = hmS'„ be the projective limit of the sets Sn taken with respect to the 
canonical projections. By the very definition, each element a G (3 is a sequence 
{cr„ G Sn}n=i,2,... such that (T„_i = Pn-i,n(c"n) for any n = 2,3, . . . . We call & the 
space of virtual permutations |Ker04] . It is a compact topological space with respect 
to the projective limit topology. 

By classical Kolmogorov's theorem, any family of probability measures on 

the groups Sn, consistent with the canonical projections, gives rise to a probability 
measure V = hmP^"^ on the space &. Taking V^"'^ = Pg'^\ 6 > 0, we get some 
measures Pg on & with nice properties; we still call them the Ewens measures. 

A parallel construction exists in the context of matrix spaces. In particular, 
by making use of the projections p^_ 

-1 Af' define a projective limit space 

Sj := \im H{N), which is simply the space of all Hermitian matrices of infinite 

size. This space carries the measures P^-* := ^imP^ , Ret > Using the 

Cayley transform, the measures P/-* can be transformed to some measures P^ on a 
projective limit space It := l^m U (N). 

As is argued in |Bor01b] . the t = case of the probability space {Sj, P^) ~ (it, Pf') 
may be viewed as an 'W = oo" version of CUEat. Likewise, we regard (6, Pi) as an 
"n = oo" version of the finite uniform measure space {Sn, P'f'^). 

A crucial property of CUEjv is its invariance under the action of the unitary group 
U{N) on itself by conjugation. This property is shared by the deformed ensemble 
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with parameter t as well. In the infinite- dimensional case, the analogous property 
is C/(oo)-invariance of the measures P^; here by U{oo) we mean the inductive limit 
group \m^U{N), which acts on the space Sj by conjugations. 

Likewise, define the infinite symmetric group Soo as the inductive limit lin^S'„, or 
simply as union of the finite symmetric groups Sn, where each group Sn is identified 
with the subgroup of ^^^-i fixing the point n + 1 of the set [n + 1]. The group 
5*00 is countable and can be realized as the group of all permutations of the set 
N = {1,2, . . . } moving only finitely many points. The space (3 contains Sqo as a 
dense subset, and the action of the group 6*00 on itself by conjugation extends by 
continuity to the space 6; we will still call the latter action the conjugation action 
of Soo- 

It is convenient to give a name to measures that are invariant under the conju- 
gation action of Sn or 5*00; let us call such measures central ones. Now, a simple 
lemma says: 

Lemma 2.2. Let, as above, V = lim'P*^") be a projective limit probability measure 
on G. IfP^"^^ is central for each n, then so is V. 



As a consequence we get that the measures Pq are central. 

In a variety of random matrix problems, the invariance property under an appro- 
priate group action makes it possible to pass from matrices to their eigenvalues or 
singular values. For random permutations from Sn directed by a central measure, a 
natural substitute of eigenvalues is another invariant — partitions of n parameter- 
izing the cycle structure of permutations. 

In combinatorics, by a partition one means a sequence p = (pi, p2, • • • ) of weakly 
decreasing nonnegative integers with infinitely many terms and finite sum \p\ 
^ Pj. Of course, the number of nonzero terms pi in p is always finite; it is denoted 
by i'(p). The finite set of all partitions p with \p\ = n will be denoted as ^{n). To 
a permutation s G Sn we assign a partition p e ^(n) (in words, a partition of n) 
comprised by the cycle-lengths of s written in weakly decreasing order. Obviously, 
p is a full invariant of the conjugacy class of s. The projection s ^ p takes any 
probability measure on Sn to a probability measure on f^{n). Now, the point is that 
this establishes a one-to-one correspondence V'^"'^ -H- 11*^"^ between arbitrary central 
probability measures V^"'^ on Sn and arbitrary probability measures 

n(«) on ^(n). 

In this sense, random permutations s G Sn (directed by a central measure) may be 
replaced by random partitions p G I^{n). 

The link between P*^") and 11*^") is simple: Given p G ^{n), let C(p) C Sn denote 
the corresponding conjugacy class in Sn- Then Il^^\p) = \C{p)\V^^\s) for any 
s G C(p). Further, there is an explicit expression for |C(p|: it is equal to n\/zp, 
where -Zp = H and rrik stands for the multiplicity of A; = 1, 2, . . . in p. 
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In this notation, the measure 11^" corresponding to the Ewens measure 

p(n) = 

Pg"^ is given by the expression 

widely known under the name of Ewens sampling formula |Ewe98] . This justifies 
the name given to the measure (12. ip . 

The following result provides a highly nontrivial "ra = oo" version of the evident 
correspondence P*^"-* f-)- 11^"): 

Theorem 2.3. There exists a natural one-to-one correspondence V ^ H between 
arbitrary central probability measures V on the space & of virtual permutations and 
arbitrary probability measures 11 on the space 

Voo := {(a;i,X2,...) e [0,1]°": xi >a;2 > 5Zxi<l}. (2.6) 

In other words, each central measure P on © is uniquely representable as a mixture 
of indecomposable (or ergodic) central measures, which in turn are parameterized by 
the points of Voo- The measure 11 assigned to V is just the mixing measure. 

Idea of proof. The theorem is a reformulation of celebrated Kingman's theorem, see 
|Kin78b] . Kingman did not deal with virtual permutations but worked with some 
sequences of random permutations that he called partition structures. Represent V 
clS db projective limit measure, P = 1^P("). By Lemma [221 all measures V^"'' are 
central. Pass to the corresponding measures II*^"^ on partitions. The consistency 
of the family with the canonical projections Sn — )■ Sn-i then translates as 

the consistency of the family {11*^")} with some canonical Markov transition kernels 
^(n) — )■ ^{n — 1). In Kingman's language this just means that {p(")} is a partition 
structure. Kingman's theorem provides a kind of Poisson integral representation of 
partition structures via probability measures on V, which is equivalent to the claim 
of Theorem 12.31 □ 

Other proofs of Kingman's theorem can be found in |Ker03] . |Ker98j . where this 
result is placed in the broader context of potential theory for branching graphs. For 
our purpose it is worth emphasizing that the claim of Theorem 12.31 has a counterpart 
in the random matrix context — description of f/(cxD)-invariant probability measures 
on S}, which in turn is equivalent to classical Schoenberg's theorem on totally positive 
functions |Sch51] . jPlcOl] . ps96] . _ 

From the proof of Kingman's theorem it is seen that the space Voo arises as a large- 
n limit of finite sets ^(n), and every measure 11 can be interpreted as a limit of the 
corresponding measures 11^") . In this picture, the ergodic measures Il(xi,x2,... ) that are 
parameterized by points {xi,X2, . . . ) G Voo arise as limits of uniform distributions 
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on conjugacy classes C(p) in growing finite symmetric groups. Thus, it is tempting 
to regard the Il(^-j:i,x2,...)'^ ^ substitute of those uniform measures. 

2.3. Application to representation theory. Pickrell's pioneer work |Pic87] demon- 
strated how some non-Gaussian measures on infinite-dimensional matrix spaces can 
be employed in representation theory. The idea of |Pic87j was further developed in 
|Ols03a] . As shown there, the measures Pf' on (or, equivalently, the measures 

on it) meet a lack of the Haar measure in the infinite-dimensional situation and can 
be applied to the construction of some "generalized (bi)regular representations" of 
the group f/(oo) x U{oo). 

Pickrell's work was also a starting point for a parallel theory for the group 5*00, 
|Ker93cj . [Ker04j . The key point is that the Ewens measures Pq on & have good 
transformation properties with respect to a natural action of the group Sqq X Sqq 
on & extending the two-sided action of 5*00 on itself. Namely, the measure Pi is 
5'oo X 5'oo-invariant and is the only probability measure on & with such a property, 
so that it may be viewed as a substitute of the uniform distribution on Sn- As for 
the Ewens measures Pe with general ^ > 0, they turn out to be quasi-invariant with 
respect to the action of Soo x 5*00. The quasi-invariance property forms the basis of 
the construction of some "generalized (bi)regular representations" of the group 
•S'oo X 'S'oo. Here z is a parameter ranging over C, and the Hilbert space of is 
L^{e,P\,\2). We refer to |Ker04] and |Uls03bj for details. 

2.4. Poisson— Dirichlet distributions. The probability measures on Voo assigned 
by Theorem 12.31 to the Ewens measures Pe are known under the name of Poisson- 
Dirichlet distributions; denote them by PD{6). Continuing our juxtaposition of 
the Ewens measures and the Hua-Pickrell measures one may say that the Poisson- 
Dirichlet distributions are counterparts of the determinantal point processes direct- 
ing the decomposition of the measures Pf' into ergodic components (those processes 
involve, in a slightly disguised form, the sine-kernel process, see |Bor01bj ). Although 
the Poisson-Dirichlet distributions PD{6) seem to be much simpler than the sine- 
kernel process, they are still very interesting objects with a rich structure. Below 
we list a few equivalent descriptions of the PD(^)'s: 

(a) Projection of a Poisson process. Let P{6) denote the inhomogeneous Poisson 
process on the half-line M>o := {r G M | r > 0} with intensity 6T~^e~'^, and let 
y — {Ui} be the random point configuration on M>o with law P{0). Due to the 
fast decay of the intensity at oo, the configuration y is almost surely bounded from 
above, so that we may arrange the i/i's in weakly decreasing order: yi > 1/2 > ■ ■ ■ > 0. 
Furthermore, the sum r := YlVi is almost surely finite. Finally, it turns out that 
r and the normalized vector x := {yi/r,y2/r, . . .) G Voo are independent from 
each other, the random variable r has the gamma distribution on M>o with density 
{r{9))^H^^^ exp{—t), and the random vector x is distributed according to PD{6). 
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This means, in particular, that PD{0) arises as the push-forward of the Poisson 
process P{0) under the projection y x. 

(b) Limit of Dirichlet distributions |Kin75j . Let Dn{0) denote the probabihty 
distribution on the {n — l)-dimensional simplex 

A„ := {(xi, . . . , x„) G M" I xi, . . . , a;„ > 0, ^ = 1} 

with the density proportional to Y[^7 ^^^^ (with respect to Lebesgue measure on 
A„). Note that Dn{d) enters the family of the Dirichlet distributions. Rearranging 
the coordinates Xi in weakly decreasing order and adding infinitely many O's one 
gets a map A„ — t- Voo! let Dn{9) stand for the push- forward of Dn{9) under this 
map. Then PD{9) appears as the weak limit of the measures Dn{9) as n — > oo. 

(c) Projection of a product measure |Ver77aj . [ArrOSj §4.11]. Consider the infinite- 
dimensional simplex 

Aoo := {(mi,M2,...) e [0,1]°° I + + < 1}. 

The triangular transformation of coordinates v = (fi,f2, . . .) ^ u = {ui,U2, ■ ■ ■) 
given by 

Ui=Vi] Un = Vn{l - Vi) . . . {1 - Vn^l), U > 2, 

maps the cube [0, 1]°° onto the simplex Aqo- This map is almost one-to-one: it 
admits the inversion u v, 

vi=ui; Vn = - — , n > 2, (2.7) 

1 - Ml Un-l 

which is well defined provided that all the partial sums of the series ui + U2 + . . . 
are strictly less than 1. 

Next, the rearrangement of coordinates in weakly decreasing order determines a 
projection Aqo — > Voo- 

Denoting by B{6) the probability measure on [0, 1]°° obtained as the product of 
infinitely many copies of the measure 6(1 - ty-^dt on [0, 1], the Poisson-Dirichlet 
distribution PD{6) coincides with the push- forward of B{6) under the composition 
map [0, 1]°° ^ Aoo ^ Voo. 

(d) Characterization via correlation functions |Wat76l §3]. Removing possible O's 
from a sequence x G Voo one may interpret it as a locally finite point configuration 
on the semi-open interval (0, 1]. This allows one to interpret any probability measure 
on Voo as a random point process on (0, 1] (see [BorlO] for basic definitions). It turns 
out that the correlation functions of the point process associated to PD{0) have a 
very simple form: 

^^"(1-^1 UnY^' ^ 

, 2^ Mi < 1; 

Ui...Un i=l 

0, otherwise. 
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This provides one more characterization of PD{9). 

The hterature devoted to the Poisson-Dirichlet distributions and their various 
connections and apphcations is very large. The interested reader will find a rich ma- 
terial in |Arr03j . |Ver72j . |Wat76j . |Ver77aj . |Ver78j . [IgE82] , jPEOT] . |Hol01j |Kin75j . 

Note that PD{9) describes the asymptotics of the large cycle- lengths of random 
permutations with law Pg""* (namely, the ith coordinate Xi on Voo corresponds to 
the ith largest cycle-length scaled by the factor of 1/n). The literature also contains 
results concerning the asymptotics of other statistics on random permutations, for 
instance, small cycle-lengths and the number of cycles |Arr03j . 

3. The Plancherel measure 

3.1. Definition of the Plancherel measure. Partitions parameterize not only the 
conjugacy classes in the symmetric groups but also their irreducible representations. 
So far we focused on the conjugacy classes, but now we will exploit the connection 
with representations. It is convenient to identify partitions of n with Young diagrams 
containing n boxes. The set of such diagrams will be denoted as Y„. Given a 
diagram A e Y„, let V\ denote the corresponding irreducible representation of Sn 
and dim A its dimension. In particular, the one-row diagram A = (n) and the one- 
column diagram A = (1") correspond to the only one-dimensional representations, 
the trivial and the sign ones. Note that the symmetry map Y„ — )■ Y„ given by 
transposition A A' amounts to tensoring V\ with the sign representation, so that 
dim A' = dim A. 

By virtue of Burnside's theorem, 

^(dimA)2 = n!. (3.1) 

AeY„ 

This suggests the definition of a probability distribution M^") on Yn- 

M(")(A):=^^^^, AgY„. (3.2) 
n! 

Following |Ver77b] . one calls M*^"-* the Plancherel measure on Y„. 

In purely combinatorial terms, dim A equals the number of standard tableaux of 
shape A |Sag01 Section 2.5]. Several explicit expressions for dim A are known, see 



ISagOll Sections 3.10, 3.11] 



3.2. Limit shape and Gaussian fluctuations. We view each A G Y„ as a plane 
shape, of area n, in the (r, s) plane, where r is the row coordinate and s the column 
coordinate. In new coordinates x = s — r,y = r + s, the boundary dX of the shape 
A C may be viewed as the graph of a continuous piecewise linear function, which 
we denote as y = A(x). Note that A'(x) = ±1, and A(x) coincides with \x\ for 

^In the context of conjugacy classes the operation of transposition has no natural interpretation. 
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sufficiently large values of |x|. The area of the shape \x\ <y < A(x) equals 2n. (See 
Figure [H) 



y 




Figure 1. The function y = A(x) for the Young diagram A = (4, 2, 1). 

Assuming A to be the random diagram distributed according to the Plancherel 
measure we get a random ensemble {A( ■ )} of polygonal lines. We will describe 

the behavior of this ensemble as n — )■ oo. 

Informally, the result can be stated as follows: Let y = X{x) be obtained from 
y = X{x) by shrinking along both the x and y axes with coefficient ^/n, 

X{x) = X(^/nx)■, 
'n 



then we have 

2 

Xix) ^ VLix) + —=^ix), n^oo, (3.3) 
'n 



where y = Q{x) is a certain nonrandom curve coinciding with y = \x\ outside 
[—2,2] C M, and A(x) is a generalized Gaussian process. Let us explain the exact 
meaning of (13. 3p . 

First of all, the purpose of the scaling A( • ) — t- A( ■ ) is to put the random ensembles 
with varying n on the same scale: note that the area of the shape \x\ < y < X{x) 
equals 2 for any n. 

The function y = Q{x) is given by two different expressions depending on whether 
or not X belongs to the interval [—2, 2] C M: 



n{x) 



— f If QTT'CTn — _L A / A 



x arcsm 



+ V4-a;2), < 2 
Ixl > 2 



In the ffist approximation, the asymptotic relation (13.31) means concentration of 
the random polygonal lines y = X{x) near a limit curve. The exact statement (see 
Log77| , [Ver77b] . and also |Iva02p is: 



Theorem 3.1 (Law of large numbers). For each n = 1,2, ... , let X G Y„ be the 
random Plancherel diagram and A( ■ ) be the corresponding random curve, as defined 
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above. As n ^ oo, the distance in the uniform metric between A( • ) and the curve 
Q{x) tends to in probability: 

lim M("UA G Y„ I sup |A(x) - n{x)\ < e} = 1, Ve > 0. 

The second term in the right-hand side of (13. 3p describes the fluctuations around 
the hmit curve. The Gaussian process A{x) can be defined by a random trigono- 
metric series on the interval [—2,2] C M, as follows. Let ^2,^3; • • • be independent 
Gaussian random variables with mean and variance 1, and set x = 2cos^, where 
0<9 <n. Then 

oo . 

A(x) = A(2cose) = -J2^ sm{ke), x E [-2,2]. 



This is a generalized process, meaning that its trajectories are not ordinary functions 
but generalized ones (i.e., distributions). In other words, it is a Gaussian measure 
on the space of distributions supported by [—2,2]. For any smooth test function ip 
on R, the smoothed series 

— \^ —= / sm{k9)(p{x)dx, 6 = arccos(a;/2), 
^ f,Vk J-2 

converges and represents a Gaussian random variable. However, the value of A(x) 
at a point x is not defined. 

More precisely, the result about the Gaussian fluctuations looks as follows: 

Theorem 3.2 (Central limit theorem for global fluctuations). Let, as above, {A( ■ )} 
be the random ensemble governed by the Plancherel measure M("\ and set 



An{x) = ^{\{x)-n{x)), XGM. 

For any finite collection of polynomials ^i{x), . . . ,ipm{x), the joint distribution of 
the random variables 



(pi{x)An{x)dx, 1 < i < m (3.4) 
converges, as n ^ oo, to that of the Gaussian random variables 

(Pi{x)A{x)dx, 1 < i < m. 



This result is due to Kerov |Ker93a] : a detailed exposition is given in |Iva02j . 

Note that for any diagram A, the function X{x) — Q{x) vanishes for |x| large 
enough, so that the integral in (13.41) makes sense. 

The theorem implies that the normalized fluctuations An{x), when appropriately 
smoothed, are of finite order. This can be rephrased by saying that, in the (r, s) 
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coordinates, the global fluctuations of the boundary dX of the random Plancherel 
diagram A G Y„ in the direction parallel to the diagonal r = s have finite order. 
A different central limit theorem is stated in |Bog07 : that result describes fluctu- 



ations at points (so that there is no smoothing); then an additional scaling of order 
y/\ogn along the y-axis is required. 

Theorem 13.11 should be compared to a similar concentration result for spectra of 
random matrices (convergence to Wigner's semicircle law). A similarity between 
the two pictures becomes especially convincing in view of the fact (discovered in 
|Ker93bp that there is a natural transform relating the curve f2 to the semicircle 
law. As for Theorem 13. 2[ it has a strong resemblance to the central limit theorems 
for random matrix ensembles, established in |Dia94] . |Joh98] . 

Biane [BiaOlj considered a modification of the Plancherel measures M(") related 
to the Schur-Weyl duality and found a one-parameter family of limit curves forming 
a deformation of Q. 

3.3. The poissonized Plancherel measure as a determinantal process. Let 

Y = Yq U Yi U . . . be the countable set of all Young diagrams including the empty 
diagram 0. To each A G Y we assign an infinite subset C{X) on the lattice Z' := Z+| 
of half-integers, as follows 

£(A) = {Ai-z + ll z = 1,2,...}. (3.5) 

We interpret C{X) as a particle configuration on the nodes of the lattice Z' and 
regard the unoccupied nodes Z' \ £(A) as holes. In particular, the configuration 
£(0) is Z'_ :={..., — |, — |} and the corresponding holes occupy Z'_^ := {|, |, . . . }. 
In this picture, appending a box to a diagram A results in moving a particle from 
£(A) to the neighboring position on the right. Thus, growing A from the empty 
diagram by consecutively appending a box can be interpreted as a passage from 
the configuration Z'_ to the configuration C{X) by moving at each step one of the 
particles to the right by 1. 

The configurations C{X) are precisely those configurations for which the number 
of particles on Z'_,_ is finite and equal to the number of holes on Z'_. Note also 
that transposition A — A' translates as replacing particles by holes and vice versa, 
combined with the reflection map x — )■ —x on Z'. 

The poissonized Plancherel measure with parameter u > jBai99] is a probability 
measure Mi, on Y, which is obtained by mixing together the measures M*^"-' (see 
(13. 2p ). n = 0,l,2,...,by means of a Poisson distribution on the set of indices n: 

M.(A) = e-'^j^M(l^l)(A) = e-^z/l^l (^)', AgY 



(see also |BorlOt Sect. 1.6]). 
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Theorem 3.3. Under the correspondence A 'C(A) defined by ( \3.5\\ . the poissonized 
Plancherel measure My turns into the determinantal point process on the lattice Z' 
whose correlation kernel is the discrete Bessel kernel. 

About determinantal point processes in general, see [BorlO] . The discrete Bessel 
kernel is written down in |BorlOt Sect. 11.6] (replace there 6 by v^)- Note that 
it is a projection kernel. Theorem 13.31 was obtained in |Joh01a] and (in a slightly 
different form) in [BorOObj . Johansson's approach |Joh01a] is also discussed in his 
note |Joh01b] and the expository paper [Joh05] . 

3.4. The bulk limit. (See |BorOObj .) Fix a G (-2,2). Recall that the point 
(a, fl{a)) on the limit curve y = Q{x) (see Section corresponds to the intersection 
of the boundary dX of the typical large Plancherel diagram A G Y„ with the line 
j — i = a^fn. The next result describes the asymptotic behavior of the boundary 
d\ near this point. 

Theorem 3.4. Assume that n ^ oo and x{n) G Z' varies together with n in such 
a way that x(n)l^fn — )■ a G (—2,2). Let A G Y„ 6e the random diagram with law 
given by ( 13. 2p and let X„ be the random particle configuration on Z obtained 
from the configuration C{X) defined by (13. 5p under the shift x ^ x — x{n) mapping 
II onto Z. Then X„ converges to a translation invariant point process on Z, with 
the correlation kernel 



The kernel S'"(fc, /) is called the discrete sine kernel. It is a projection kernel and 
should be viewed as a lattice analogue of the famous sine kernel on M originated 
in random matrix theory. Like the sine kernel, the discrete sine kernel possesses a 
universality property |Bai07j . 

Theorem 13.41 is derived from Theorem 13.31 Let J'^(x, y) denote the discrete Bessel 
kernel; one shows that 



and then one applies a depoissonization argument to check that the large-n limit 
and the large- limit are equivalent. 

3.5. The edge limit. 

Theorem 3.5. Let A = (Ai,A2,...) G Y„ be distributed according to the nth 
Plancherel measure given by (13. 2p . For any fixed k = 1, 2, ... , introduce real- 
valued random variables Ui, . . . ,Uk by setting 




71 



lim r{x{u) + k, x(z/) + /) = S%k, I) 



x(z/) G Z', x^u) ~ Oa/z/, 



1/6 



I = 



l,...,k. 



(3.6) 
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Then, as n oo, the joint distribution of Ui, . . . , Uk converges to that of the first k 
particles in the Airy point process. 

Recall ( [BorlOt Sect. 1.9]) that the Airy point process is a determinantal process 
on M living on point configurations {ui > U2 > • • • ) bounded from above; it is 
determined by the Airy correlation kernel, which is a projection kernel on M. 

The Airy point process arises in the edge limit transition from a large class of 
random matrix ensembles. It turns out that it also describes the limit distribution 
of a few (appropriately scaled) largest rows of the random Plancherel diagram. 

Due to symmetry of M*^"^ under transposition A — )■ A', the same result holds for 
the largest column lengths as well. 

Already the simplest case = 1 of Theorem 13.51 is very interesting, especially 
because of its connection to longest increasing subsequences in random permutations 
(see Section [3.61 below). The claim for A; = 1 was first established by Baik, Deift, 
and Johansson |Bai99| : then they proved the claim for k = 2, |BaiOO] . Their work 
completed a long series of investigations and at the same time opened the way to 
generalizations. The general case of Theorem 13.51 is due to Okounkov |OkoOO] ; note 
that his approach is very different from that of |Bai99] , [BaiOO] . Shortly afterwards, 
the theorem was obtained by yet another method in independent papers [BorOObj 
and [JohOla] , by using Theorem 13.31 as an intermediate step. Note that once one 
knows Theorem 13. 3^ the precise form of the scaling (13. 6p can be guessed by a simple 
argument, see jOlsOS j. 

3.6. Longest increasing subsequences. Given a permutation s G Sn, let Ln{s) 
stand for the length of the longest increasing subsequence in the permutation word 
s := s(l)s(2) . . . s{n). @ Under the uniform distribution on Sn, Ln becomes a random 
variable. In the sixties, S. Ulam raised the question about its asymptotic properties 
as n — )■ 00. This seemingly rather particular problem turned out to be surprisingly 
deep (about the history of the problem and many related results, see |Bai99j and the 
survey papers |Ald99j , [DeiOOj , |Sta07j ) . The next claim relates L„ to the Plancherel 
measure M^"); 

Theorem 3.6. The distribution of Ln under the uniform measure on Sn coincides 
with the distribution of Xi, the first row length of the random Young diagram A G Y, 
with law M^*") . 



n 



This result is obtained with the help of the Robins on- Schensted correspondence, 
which establishes an explicit bijection RS : s -H- {V, Q) between permutations s G 
and couples {V, Q) of standard tableaux of one and the same shape A G Y„. The 
bijection RS is described in detail in many textbooks, e.g., jFul97j and [SagOl]. The 



latter book also contains an elegant geometric interpretation of RS due to Viennot. 



2An increasing subsequence in s is a subword s{ii) . . . s{ik) such that ii < ■ ■ ■ < ik and s(zi) < 
• < s{ik)- 
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By the very definitions, the push-forward under RS of the uniform measure on Sn 
is M^^\ A nontrivial fact is that under this bijection, = Ai. 

By virtue of Theorem I3.6[ the Ulam problem is completely solved by the k = 1 
case of Theorem 13.51 discussed above: the limit distribution of the scaled random 
variable (L„ — 2y/n)n~^^^ is the GUE Tracy- Widom distribution F2 |Tra94] . 

Given a subset S* C Sn, denote by L* the random variable Ln{-) directed by the 
uniform measure on S**. A modification of the Ulam problem consists in studying the 
limit distribution of L* (suitably centered and scaled) for subsets S* determined by 
certain symmetry conditions imposed on the matrix [sij] of a permutation s & Sn- 
Baik and Rains (see ^BaiOlj and references therein) showed that in this way one 
can get two other Tracy- Widom distributions |Tra96j . Fi and F4, as well as a large 
family of allied probability distributions including an interpolation between Fi and 
F4. These results demonstrate once again a similarity in asymptotic properties of 
random permutations and random matrices. Here is the simplest example from 
[BaiOlj . which shows that involutions s = in Sn (i.e., symmetric permutation 
matrices) model real symmetric matrices: 

Theorem 3.7. Take as 5** the subset of involutions in Sn, and let L* be the cor- 
responding random variable. Then the limit distribution of (L* — 2y/n)n^^^^ is the 
GOE Tracy- Widom distribution Fi. 

4. The z-measures and Schur measures 

4.1. The z-measures. The identity (13. ip admits an extension depending on two 
parameters z, z' G C: 

Y,{zUz'U^m.Xf = {zz')nn\, 
AeY„ 

where (x)„ := x(x + 1) . . . (x + n — 1) is the Pochhammer symbol and [x)\ is its 
generalization, 

{^)\ ■= n (^+-^ 

{i,i)eA 

the product taken over the boxes belonging to A, where i and j stand for the 
row and column number of a box. The (complex-valued) z-measure Mf3 on 
assigns weights 

Mi") (A) = (f^(f)^ M(")(A) = 

{zz')n {zz')n n\ 

to diagrams A G Y„. This is a deformation of the Plancherel measure M^") in the 
sense that MfJ(A) ^ M(")(A) as 2;, z' — 00. In what follows we assume that the 
parameters take admissible values meaning that {z)\{z')\ > for any A G Y and 
zz' > (for instance, one may assume 2;' = ^ G C \ {0}). Then M"^, is a probability 
measure for every n. 
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The z- measures first emerged in |Ker93cj : they play an important role in the 
representation theory of the infinite symmetric group Soo'- Recall that in Section [53] 
we have mentioned generalized regular representations T^; it turns out that when 
z' = z, a suitably defined large-n scaled limit of the z-measures governs the spectral 
decomposition of into irreducibles: |Bor01a| §3], [OlsOSbj . 

The mixed z-measure Mz^z',£, on Y with admissible parameters {z, z') and an ad- 
ditional parameter ^ G (0, 1) is obtained by mixing up the z-measures with varying 
superscript n by means of a negative binomial distribution on Z_|_: 

M„,,(A) = (1 -{)"'^^^M^A/;W>(A) = (1 (^1^)', 

where A ranges over Y. This procedure is similar to poissonization of the Plancherel 
measure and serves the same purpose of facilitating the study of limit transitions. 
Note that the poissonized Plancherel measure M,y is a degeneration of M^^^/^^ when 
z,z' oo and ^ — )■ in such a way that zz'C, — )■ z^. 

Theorem 4.1. Under the correspondence A — )■ C{X) defined by (13.51) . the mixed 
z-measure Mz^z',i turns into a determinantal point process on the lattice U whose 
correlation kernel can he explicitly expressed through the Gauss hypergeometric func- 
tion. 

This is a generalization of Theorem 13.31 Various proofs have been given in 
[BorOOaj . [BorOOcj . [OkoOlbj . |Bor06j . 

For the lattice determinantal process from Theorem 14 . 1 1 there are three interesting 
limit regimes, as ^ — ?■ 1, leading to continuous and discrete determinantal processes: 

(1) Split Z' into positive and negative parts, Z' = Z^ U Z'_. Given A G Y, let 
£°(A) C Z' be obtained from C{X) by switching from particles to holes on 
Z'_; then C°{X) is finite and contains equally many particles in Z^ and in 
Z'_. Note that this particle/hole involution does not affect the determinantal 
property. Next, scale the lattice Z' making its mesh equal to small parameter 
6 = 1 — C,- Letting ^ — )■ 1, one gets in this way from Mz^z',s, a determinantal 
process living on the punctured real line M \ {0}. The corresponding corre- 
lation kernel is called the Whittaker kernel, because it is expressed through 
the classical Whittaker function. This limit process is of great interest for 
harmonic analysis on the infinite symmetric groups. For more detail, see 
IBorOOaj . |Ols03b] . 

(2) No scaling, we remain on the lattice. The limit determinantal process is 
directed by a diffuse measure on the space {0, 1}^ of all lattice point config- 
urations, and the limit correlation kernel is expressed through Euler's gamma 
function, see |Bor05c] . 
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(3) An "intermediate" limit regime assuming a scaling. It leads to a stationary 
limit process whose correlation kernel is expressed through trigonometric 
functions and is a deformation of the sine kernel, see |Bor05c] . 

These three different regimes describe the asymptotics of the largest, smallest and 
intermediate Frobenius coordinates of random Young diagrams, respectively. 

Remark 4.2. Note a special role of the quantity dimA/|A|! in the expression for 
Mz^z',i'- this is a Vandermonde-like object, which creates a kind of log-gas pair in- 
teraction between particles from the random configuration £(A) (about log-gas sys- 
tems, see |ForlOaj . |ForlOb] ). The particle/hole involution £(A) — )• C°{\) changes 
the sign of interaction between particles on the different sides from 0, so that we get 
two kinds of particles which are oppositely charged. Note that in the first regime, 
the particle/hole involution is necessary for existence of a limiting point process. 
The Whittaker kernel is an instance of a correlation kernel which is symmetric with 
respect to an indefinite inner product. 

4.2. Special instances of z-measures. (a) Meixner and Laguerre ensembles. 
Assume z = = 1, 2, . . . and z' = N -\-b — l with b > 0; these are admissible values. 
Then M^^^/^^ is supported by the subset Y(A^) C Y of Young diagrams with at most 
A^ nonzero rows. Under the correspondence 

Y{N) 3 X ^ ih, ^2, . . . , /jv) = (Ai + A^ - 1, A2 + AT - 2, . . . , Aiv) C Z+, 

the measure turns into a random-matrix- type object: the A^-particle Meixner or- 
thogonal polynomial ensemble with the discrete weight function (6);^'//!, where 
the argument / ranges over (for generalities about orthogonal ensembles, see 
|Kon05j ). It follows that for general values of {z,z'), the measure M^^z',^ may be 
viewed as the result of analytic continuation of the Meixner ensembles with respect 
to parameters A^ and b. This observation is exploited in |Bor06j . In a scaling limit 
regime as ^ — )• 1, the A^-particle Meixner ensemble turns into the A^-point Laguerre 
ensemble; the correlation kernel for the latter ensemble is a degeneration of the 
Whittaker kernel, see |BorOOa] . 

(b) Generalized permutations . Recall that the Plancherel measure M(") on Y„ 
coincides with the push-forward of the uniform measure on S'„ under the projection 
Sn — )■ Y„ afforded by the Robinson-Schensted correspondence RS (Sectior iS^ . 
Here is a generalization: 

Fix natural numbers N < N' and replace Sn by the finite set S*^^^, consisting of all 
N X N' matrices with entries in such that sum of all entries equals n. Elements of 

(n) 

j^, are called generalized permutations. Knuth's generalization of the Robinson- 
Schensted correspondence (the RSK correspondence, see, e.g., |Ful97t Section 4.1]) 
provides a projection of iS}^^^, onto Y„(A^) := Y„ n Y(A^), the set of Young diagram 
with n boxes and at most A^ nonzero rows. It turns out that the push-forward of 
the uniform distribution on S'j^^^, coincides with the z- measure M^jy, , see |Bor01aj . 
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(c) A variation. In the same way one can get the mixed z- measure Mn,n',£, if 
instead of 5"^]^, one takes N x N' matrices whose entries are i.i.d. random variables, 
the law being the geometric distribution with parameter ^. 

(d) Random words. Denote by 5'jj*|^ the set of words of length n in the alphabet 

[A^] := {1, . . . , N}. Endowing 5"^!^ with the uniform measure we get a model of 
random words. This model may be viewed as a degeneration of the model of random 
generalized permutations (item (b) above) in the limit N' ^ oo (this explains the 
notation S^j^^^). The RSK correspondence (or rather its simpler version due to 
Schensted) provides a projection S*^)^ — )■ Y„(A^) taking random words to random 

Young diagrams A G Y„(A^) with distribution M^]^ := \imis[>^oo ^n^n'- Asymptotic 
properties of random words are studied in [TraOlj and [JohOlaj . The model of 
random words can be generalized by allowing non-uniform probability distributions 
on the alphabet (see [ItsOlj and references therein). As explained in [ItsQ l], this more 
general model is connected to the Schur measure discussed in Section 14.31 below. 

(e) The Charlier ensemble and the Plancherel degeneration. Poissonization of 
the measure M^]^ with respect to parameter n leads to the A^-particle Charlier 
ensemble [BorOlat §9]. Alternatively, it can be obtained as a limit case of the mixed 
z-measures M]si^N',e,- The poissonized Plancherel measure appears as the limit 
of the mixed z-measures Mz^z',i when 2;' — )• oo and ^ — )■ in such a way that 
zz'^ — >■ V. This fact prompted the derivation of the discrete Bessel kernel (Theorem 
13. 3p in [BorOOb] . Alternatively, My can be obtained through a limit transition from 
the Charlier or Meixner ensembles; this leads to another derivation of the discrete 
Bessel kernel: [JohOlaj . [JohOlbj . 

4.3. The Schur measures. Let A denote the graded algebra of symmetric func- 
tions. The Schur functions sa, indexed by arbitrary partitions A G Y, form a dis- 
tinguished homogeneous basis in A. As a graded algebra, A is isomorphic to the 
algebra of polynomials in countably many generators; as these generators, one can 
take, for instance, the complete homogeneous symmetric functions hi,h2, . . . where 
deghk = k. One has sx = det[hx-_i+j] with the understanding that Hq = 1 and 
hk = for A; < (the Jacobi-Trudi formula); here the order of the determinant can 
be chosen arbitrarily provided it is large enough. For more detail, see, e.g., |Sag01| . 

Given two multiplicative functionals ip,tp: A ^ C, the corresponding (complex- 
valued) Schur measure M^^^ on Y is defined by 

M^,^{X) = const^-^ </)(sa)V'(sa), A G Y, const = ^ Lp{sx)'ip{sx), 

AeY 

provided that the sum is absolutely convergent (which is a necessary condition on 
^,ip). This notion, due to Okounkov [OkoOlaj . provides a broad generalization of 
the mixed z-measures. Since a multiplicative functional is uniquely determined by 
its values on the generators h^, the Schur measure has a doubly-infinite collection of 
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parameters {ip{hk),4'{hk); k = 1,2, . . .}. In this picture, the z-measures correspond 
to a very special collection of parameters 

and the poissonized Plancherel measure M,^ appears when ip{hk) = ^{hk) = v^^'^ /k\. 

As shown in [OkoOlaj . Theorem 14.11 extends to Schur measures: if the param- 
eters are such that the measure M^p^^ is nonnegative (and hence is a probability 
measure), then it gives rise to a lattice determinantal point process. Moreover, for 
the corresponding correlation kernel one can write down an explicit contour integral 
representation [BorOOcj . Such a representation is well suited for asymptotic analysis. 

If Lf and ip are evaluations of symmetric functions at finitely many positive vari- 
ables, the first row Ai can be interpreted as the last passage percolation time in a 
suitable directed percolation model on the plane, see | JohOSj . 

4.4. Some generalizations. Kerov |KerOOj generalized the construction of the z- 
measures M^"^, by introducing an additional parameter related to Jack polynomials. 
This new parameter is similar to the /3 parameter in random matrix ensembles 
[ForlObj . In particular, the Plancherel measure M("), which is a limit case of the 
z-measures, also allows a /3- deformation |KerOO] . |Oko05] . |Oko06] . The ordinary 
z-measures correspond to the special value (3 = 2, and in the limit /3 — )■ the beta 
z-measures degenerate to the measures (12.51) derived from the Ewens measures, see 
[OlslOl Section 1.2]. Thus, the (3 parameter interpolates between the models of 
Section [2] and those of Sections [SHU as has been pointed out in the end of Section [H 
Note also that replacing the Schur functions by the Jack symmetric functions leads 
to a natural /3-deformation of the Schur measures. 

As in random matrix theory, the value /3 = 2 is a distinguished one, while in the 
general case /9 > the situation is much more complex. Some results for /3 7^ 2 can 
be found in |Bor05bj . jPuIOi] . |UlslOj . [StrlOaj . [StrlObj . 

In a somewhat different direction, one can define natural analogues of the Plancherel 
measure and Schur measures for shifted Young diagrams (equivalently, strict par- 
titions): |Tra04] . |Mat05] . This theory is related to Schur's Q-functions (a special 
case of Hall-Littlewood symmetric functions that appears in the theory of projective 
representations of the symmetric group). Surprisingly enough, a natural analogue 
of the z-measures for shifted diagrams, discovered by Borodin and recently studied 
in |PetlOj . seems to be not related to Schur's Q-functions. 

Finally, note that there are many points of contact between the results described 
in this chapter and Fulman's work on "random matrix theory over finite fields" , see 
his survey |Ful01] and references therein. 
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